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Abstract 

We  describe  a  commutative  Thue  system  that  has  ~2n  variables  and 
0(n)  rules,  each  rule  of  length  d  +  0(1)  and  that  counts  to  d'  in  a  certain 
technical  sense.  This  gives  a  more  efficient  alternative  to  a  well-known  con- 
struction of  Mayr  and  Meyer.  Such  constructions  are  used  to  derive  exponen- 
tial space  lower  bounds  for  the  word  problem  for  commutative  Thue  systems, 
but  more  recently  have  been  used  to  yield  double-exponential  degree  bounds 
for  Grobner  bases  and  syzgies. 


1      Introduction 


Mayr  and  Meyer  [4]  proved  that  the  uniform  word  problem  for  commutative  Thue  sys- 
tems is  exponential  space  complete.  The  key  to  their  proof  is  the  construction  of  a 
commutative  Thue  system  Sm  that  "counts"  to  2^  in  this  sense:  there  are  distinguished 
variables  A,  Qo  and  Qoo  such  that  if 

Qo  ^  w  (mod  Sm) 

and  Qoo  occurs  in  w  then  necessarily  w  =  QooA  ■  (These  notations  are  defined  in  the 
next  section.)  Their  construction  Sm  has  been  the  basis  of  several  other  lower  bounds 
proofs  in  polynomial  ideal  theory  [1][3][5]. 

Although  the  original  construction  of  Mayr-Meyer  uses  14n  variables,  various  authors 
(including  Lazard,  Bayer- Stillman)  have  noticed  that  lOn  variables  suffice  without  any 
essential  change.  A  nice  exposition  of  the  Mayr-Meyer  construction  has  been  given 
by  Bayer- Stillman  [1].  In  this  paper  we  give  a  more  efficient  alternative  to  the  Mayr- 
Meyer  construction,  using  only  ~  2n  variables.  We  remark  that  it  is  of  some  interest 
to  try  reduce  the  number  of  variables,  especially  to  ~  n  variables,  since  this  could 
lead  to  matching  upper  and  lower  bounds  for  the  maximum  degree  of  reduced  Grobner 
basis.  The  sharpest  upper  bound  to  date  has  been  shown  by  Dube  [2].  In  particular, 
Dube  proved  that  the  reduced  Grobner  baisis  of  a  set  of  polynomials  on  n  variables 
has  maximum  degree  bounded  by  (P"  where  d  is  a  bound  on  the  degree  of  the  original 
polynomials. 

In  view  of  the  applications  of  such  constructions,  it  is  important  to  find  simple-to- 
understand  constructions.  At  least  the  bcisic  form  of  our  construction  (section  3,  com- 
pressed rules)  has  this  property.  Our  construction  as  well  as  the  Meyer-Mayr  construc- 
tion can  be  viewed  as  the  attempt  to  convert  what  is  essentially  a  reversible  nondeter- 
ministic  computing  device  (as  represented  by  a  Thue  system)  into  a  deterministic  one  - 
such  constructions  go  back  to  Post  [6]. 

This  paper  is  organized  as  follows:  Section  2  establishes  some  terminology.  In  section 
3,  we  describe  a  Thue  system  So  that  counts  up  to  cP  ,  and  section  4  proves  that  So 
has  certain  uniqueness  properties.  Sections  5  to  7  give  successive  modifications  to  5o 
until  the  final  Soo  which  uses  2n  -f  0{y/n)  variables  and  has  0{n)  rules  each  of  size 
d  +  0(1).  The  correctness  of  5oo  is  made  transparent  by  the  technique  of  successively 
"embedding"  a  system  5,-  with  certain  desired  properties  into  another  system  5,+i  with 
the  result  that  Si+i  has  analogous  properties.  This  approach  also  has  the  advantage  of 
exposing  the  different  ideas  that  go  into  the  final  Thue  system.  We  conclude  in  section 
8,  using  52  to  show  that  the  ideal  membership  and  minimal  syszgy  problems  can  have 
double  exponential  degrees  in  the  worst  case. 


2      Preliminaries 


Let  E  =  {Xi, . . .  ,Xn]  be  an  alphabet  where  each  X,  is  called  a  (commutative)  variable. 
Let  E®  denote  the  free  commutative  monoid  generated  by  E.  An  element  w  of  E®  is 
called  a  (commutative)  word  and  w  can  be  regarded  as  a  function 

u;  :  E  ->  N 

where  N  is  the  set  of  non-negative  integers.  U  X  £  T,,  we  call  w{X)  the  X-degree  of  k; 
and  denote  it  by  deg;^(ii;).  We  prefer  to  write  w  in  the  form 

1  =  1 

A  semi-Thue  system  (over  E)  is  a  finite  set  S  of  pairs  in  E®.  Each  pair  (q,/9)  G  5  is 
called  a  rule  (or  production).  5  is  called  a  Thue  system  if  {oc,0)  €  5  imphes  (/3,  q)  G  5. 
Call  (/?,  a)  the  reverse  of  (a,^).  Also  /3  and  a  (respectively)  are  called  the  precondition 
and  postcondition  of  the  rule  (a,/3). 

Given  commutative  words  v,w,  we  write 

V  —V  u;(mod  5) 

if  for  some  (a, /3)  G  5  and  7  G  E®,  we  have  v  =  70  and  it;  =  7/?.  We  also  call  "y  — ►  ly" 
a  transition  (of  5).  The  reflexive,  transitive  closure  of  — >  (mod5)  is  denoted  A  (mod5). 
A  sequence  {wi,W2,  ■ .  ■  ,Wk)  {k  >  2,iu,  G  E®)  is  called  a  derivation  of  5  from  w^  to  u;^  if 
ly,-  —y  Wi^i{modS)  for  i  =  1, ... ,/:  —  !.  The  derivation  has  a  repetition  if  u;,  =  Wj  for  some 
i  <  j;  the  repetition  is  trivial  if  i  —  j  —  2.  A  derivation  is  repetition-free  (resp.,  simple)  if 
it  has  no  repetition  (resp.,  non-trivial  repetition).  A  derivation  from  w\  to  Wk  is  said  to 
be  unique  if  it  is  repetition-free  and  there  are  no  other  repetition-free  derivations  from 

W\   to  Wk. 

In  this  paper,  we  are  interested  in  derivations  from  a  distinguished  word  u»o  associated 
with  S  which  we  call  its  initial  assertion.  All  words  that  can  be  derived  from  the  initial 
assertion  are  called  assertions  of  S.  (This  terminology  iifrom  Post[6].) 

Let  r  C  E  and  w,v  £  E®.  Define  degp(ti;)  to  be  J2xer'^^S>xi^)-  If  T  =  E,  then 
deg^{w)  is  simply  denoted  deg{w).  The  unique  word  u>  G  E®  with  deg(iL')  =  0  is 
denoted  (without  fear  of  confusion)  by  "1".  We  say  w  is  linear  in  T  if  degp(u')  =  L 
We  say  w  divides  v,  denoted  u;  |  u,  if  for  all  X  G  E,deg;f  (u))  <  deg_jf  (y).  If  10  does  not 
divide  v,  we  write  lu  'f  u. 

Let  (5,  E)  be  a  commutative  Thue  System,  E  =  {Xi, . . .  ,Xn}-  Let  R  denote  the 
polynomial  ring  Q[E]  =  Q[Xi, . . . ,  A'"„]  where  Q  is  the  field  of  rational  numbers.  For 
each  rule  q  — >  ^  G  5,  we  form  the  polynomial  /3  — q  G  R,  and  let  Fs  denote  the  resulting 
set  of  polynomials. 


The  word  problem  (for  commutative  Thue  systems)  is  this:  Given  (5,  a,/3)  where  q,/3 
are  commutative  words  over  E,  we  want  to  know  if  o;  A  ^{modS).  The  ideal  membership 
problem  is  this:  Given  {F,f)  where  F  C  R  h  a.  finite  set  of  polynomials  and  /  €  i?,  we 
waoit  to  know  if  /  G  (F)  where  {F)  C  i?  is  the  ideal  generated  by  F.  As  shown  in  [4], 
the  word  problem  can  be  reduced  to  the  ideal  membership  problem  as  follows: 

a  A  ^(mod5)  iff  ^  -  a  €  (Fs). 

Let  F  C.  R  and  suppose  we  fix  some  arbitrary  order  on  the  elements  in  F,  and  write 
F  =  (/!,...,/„,).  Then  a  syzygy  of  F  (or  simply,  F)  is  g  =  {g^,. . .  ,gm)  G  R""  such 
that  Yl^iLi  fi9i  —  0-  Clearly  the  set  of  syzygies  form  an  i?-module  in  the  natural  way. 
The  degree  of  g  is  maxi=i_ ..  „,{deg(5,)},  and  written  deg(5).  A  syzygy  g  is  minimal  if 
it  cannot  be  written  as  the  (componentwise)  sum  of  two  syzygies,  g  =  g'  -\-  g"  where 
deg(7)  >  0,deg(^)  >  0  and  GCD{g)  =  1. 


3     First  steps:  the  compressed  rules 

We  give  a  simple  construction  of  a  "quadratic  counter"  that  will  subsequently  be  im- 
proved upon. 

Constants.  Throughout  this  construction,  we  fix  the  integers  n  >  1  and  d  >  2.  We 
also  define  ej(/c)  =  e(^)   :=  d^    for  all  k  >  0.  Observe  that  e{k  -f  1)  =  e(/r)^. 

Variables.  There  are  three  types  of  variables. 

1.  State  variables:  Qstart  ,  Qfinish 

2.  Accumulator  variables:     A^^Bk,  (fc  =  0, 1, . . .  ,n) 

3.  ^  Flag  variables:  Ffc[color]  (A;  =  1,..  .,n;  color  G  {inc,  dec,  pass}). 

Let  Eo  denote  this  set  of  5n  -|-  4  variables.  Except  for  the  state  variables,  the  others 
each  belong  to  some  level  which  is  an  integer  from  0  to  n.  We  are  mainly  interested  in 
commutative  words  of  the  following  form: 

w  =  QA^^B^°  n  A-^'Bl^FkicoloTk] 

where  Q  '\s  a.  state  variable  and  mo,no,  77ifcj^*:  ^  0.  We  call  such  words  well-formed. 

Note  in  particular,  a  well-formed  word  is  linear  in  {Qstart  ■,  Qf  inish)  ^.nd  linear  in 
{Ffc[inc],Fjt[dec],Fjt[pass]}  for  each  /:  =  l,...,n. 


For  1  <  A:  <  n  and  A:<£<n  +  l,we  use  the  abbreviation: 

Fk,t[coloT]  =  Yl  F, [color] 
.=* 

Thus  Fjt.it [color]  =  1.  The  following  well-formed  word  is  designated  the  initial  assertion 

Wq   :=   Q start K^hn+i[^^c]. 

The  rules  which  we  will  next  specify  ensure  that  all  assertions  are  well-formed.  Further- 
more, the  flag  variables  at  each  level  cycle  through  the  sequence 

inc  =:^  pass  =^  dec  ^=>  inc  =>  •  •  • 

We  now  present  the  rules,  which  naturally  fall  under  two  groups: 

Start  Rules,  {k  =  1,2, ...  ,n) 
(Sl)fc     Ac                Qst^F,,k[v^ss]Fk[xnc]      BoAf        ("increment  rule") 
(S2),     AoBt            Qstarti^i.A:[pass]n[dec]^     ^^  ("decrement  rule") 

(S3)       Qstart^o^    ►    ^finish     ("finish  rule") 

Finish  Rules.  (A;  =  1,2, ...  ,n  —  1) 

(Fl)fc     Qfiiiish^i,fc[dec]F;t[inc]         ^  (53tart^-Fi.jt[inc]Ffc[pass] 

("inc  =>  pass  rule") 

(F2),     Qfi^isbF,A<i^c]A,  Ffc[pass]  ^  Q^^^^AiF,Ai^c]B, 

("pass  =^  pass  rule") 

(F3),     (3finishi^i,*[dec]F*[pass]/lfc .  g,t^t>l^Fi.fc[inc]n[dec]Bt 

("pass  =>  dec  rule") 

Let  (5o,  So)  denote  the  Thue  system  corresponding  to  these  rules. 

Remark:  We  write  the  rule  (q,^)  as  a  — >  /?  above  to  be  suggestive  of  the  "forward" 
direction  of  applying  the  rules.  But  one  must  remember  that  we  are  describing  a  Thue 
system,  so  the  reverse  rule  /?  — ►  or  is  also  implied.  A  derivation  that  only  uses  the 
forward  (resp.  reverse)  rules  will  be  called  a /o ru;a nf  (resp.  reverse)  derivation;  otherwise 
the  derivation  is  mixed.  Furthermore,  if  07  — >  ^7,  we  may  write  the  rule  as 

-r 
Q  >  13 


as  in  (SI)  and  (S2).  (We  call  (Si),  for  instance,  a  rule  even  though  it  is  really  a  family 
of  n  rules  parameterized  by  k\  this  abuse  of  language  should  be  harmless.)  Since  7  is 
unchanged,  we  call  it  a  "catalyst"  for  the  rule.  In  this  section,  all  transitions  — >  ,  A  are 
understood  to  be  (mod5o). 

Let  us  briefly  comment  on  the  rules:  the  start  rules  are  apphcable  only  if  we  axe  in  the 
state  Qstart-  Rule  (Si)  says  that  the  accumulator  variable  Aq  can  convert  to  Bq  in  the 
presence  of  (the  catalyst)  Qstart -fi,Jt[pa-ss]F;t[inc],  at  the  same  time  producing  d  copies 
of  Ak-  We  see  that  the  "pass"  flags  F,[pass]  at  levels  i=-lioi  =  k  —  l  are  used  to  signal 
that  the  accumulators  Ai  should  be  ignored,  and  the  flag  Fi[inc]  signals  that  Ak  should 
be  "incremented".  Rule  (S2)  is  the  counterpart  of  (Si)  where  the  flag  Ffc[dec]  signals 
the  "decrementing"  of  accumulator  Bk.  For  instance,  if  we  start  with  d  copies  of  Aq, 
and  the  catalyst  for  rule  (Si)  is  present  (this  is  the  ca^e  with  the  initial  assertion  ioq), 
then  we  may  apply  rule  (Si)  (i  times.  This  converts  A^  to  Bq.  Which  brings  us  to  rule 
(S3)  -  this  rule  is  now  applicable  -  and  we  end  up  in  the  finish  state  (^f  inish-  Of  course 
by  the  time  we  finished,  we  managed  to  create  e(l)  =  d?  copies  of  Ak-  Conversely,  if 
we  managed  to  apply  (S2)  d  times  before  applying  (S3),  we  would  have  destroyed  e(l) 
copies  oi  Bk- 

Let  us  similarly  note  the  sahent  features  of  the  finish  rules:  in  each  case,  the  state 
Qfinish  transforms  to  Qstart^o-  Furthermore,  the  initial  block  of  "decrement  flags" 
Fi,/:[dec]  is  converted  to  Fj  fc[inc].  Hence  the  differences  among  the  finish  rules  lie  in 
the  flag  at  level  k:  in  case  of  (Fl),  the  flag  is  i^jt[inc]  and  we  convert  it  to  F/t[pass].  In 
(F2),  the  flag  F^tfpass]  acts  only  as  a  catalyst  for  converting  an  Ak  to  Bk-  In  (F3),  we 
convert  F;t[pass]  to  Ft[dec],  and  at  the  same  time  convert  an  Ak  to  a  Bk- 

Evidently,  both  (F2)  and  (F3)  have  the  same  preconditions.  This  introduces  a  non- 
deterministic  choice  in  derivations.  However,  we  shall  see  that  this  choice  is  forced  if 
we  want  the  longest  possible  derivation.  Again,  if  we  look  at  the  reverse  of  the  rules, 
we  notice  that  the  precondition  for  the  reverse  of  rule  (F2)  subsumes  the  precondition 
for  the  reverse  of  (Fl).  We  shall  analyze  the  consequences  of  these  non-deterministic 
choices  later. 

We  see  here  that  the  flags  at  each  level  are  transformed  by  these  finish  rules  in  the  cycle 
(inc  =^  pass  =>  dec  =>  inc  =>...)  as  noted  earher.  Of  course,  one  must  remember 
that  these  rules  are  reversible  (we  are  describing  a  Thue  System  here)  so  this  cychc 
nature  needs  to  be  stated  more  precisely:  in  any  forward  derivation,  the  flags  at  each 
level  transform  cyclically  as  indicated. 

Finally,  observe  that  the  rules  preserve  well-formedness  of  words.  In  particular,  all 
assertions  are  well-formed. 

To  see  these  rules  in  action,  it  is  best  to  follow  the  proof  of  the  foUowing  lemma: 


Lemma  1  Let  I  <  k  <  £  <  n  and  p  >  0,  and  let  w  be  any  commutative  word. 
(a)k,/     Let  Ui  =  u;Qstart^o'^?^i>[iiic]Ffc^[pass]F4inc].   Then  Uj  A  u[  where 
u'l  =  wQiij,iahA^f'^'^''^Fi^k[dec]Fk,([pa.ss]Fe[inc] 

(b)k/     Let  vi  =  wQatartAoB^  '    'Fi,ifc[inc]Fjt,4pass]F4dec] .   Then  Vi  A  v\  where 

v[  =  u;(5fiiii3h-B?Fi,fc[dec]FA:,4pass]Ff[dec] 

Proof:  First  let  k  =  I. 
(a)i/  wQataTtAiA^fFi_t\p3^ss]Fi[inc] 

^-^     wQ,,^Ai-'BoA^,^'F,,tb^ss]Ff[iTic] 
:        (apply  (Si)  for  J  —  1  more  times) 

(5y     zx;Q,tartSo^Ar'^'^Fi,,[pass]F4inc] 

^-S     u;gfi„i3hAr'^'^i^i,apass]F,[inc] 


(b)i/  tx;Q,tart/1^5r'^'^Fi,,[pass]F,[dec] 

^-^     u;Q3tartA^-^Bo5r^'^~''Fi^[pass]F,[dec] 
:        (apply  (S2)  for  d  —  1  more  times) 


(53     u;Q3,^Bo^BPF,.,[pass]F,[dec] 


(S3} 


"'<5fini3h-6^Fi,,[pass]F;[dec] 


Now  assume  fc  >  1. 
(a)./ 


u\ 


tf(5startv4o^/Fi,it[inc]Ffc,4pass]F/[inc]  =     Uj 

(Fl)  u;QBtartA^yl?_'r'^A^Fi,,_i[inc]Ffc_,[pass]Ffc,,[pass]F,[inc]  =     U3 

* 
(F2)  u;Q3tartAoM£-^^-^BLiAr^('-^)Fi,,_i[inc]n_a,,[pass]F,[inc]     =     U5 

* 
:  (by  e(/:  —  1)  —  2  more  applications  of  last  2  steps) 

(F3)  u;QstartA^BlL'r'^Ar'^'^^i,fc-i[inc]F,_i[dec]Fi,,[pass]F,[inc]      =     ug 

ind(6),_i,fc_i       ^^Q,i^i3^A^^^'^Fl,fc_a[dec]F,_a[dec]F,,,[pass]F,[inc] 

Note:  The  inductive  invocations  of  the  lemma  are  labeled  "ind(a)"  or  "ind(b)". 

(b)A:,/  We  omit  the  similar  derivation  for  this  case:  the  only  difference  is  that  all  but  the 
first  inductive  invocation  turns  from  "ind(a)"  to  "ind(b)".  Q.E.D. 

We  shall  call  the  derivations  expUcitly  given  in  the  proof  of  this  lemma  the  "standard 
derivations"  (at  level  k).  In  the  next  section  we  prove  that  these  are  essentially  unique. 

4      Uniqueness  properties 


If  we  apply  lemma  1  to  the  initial  assertion  t^o  with  the  parameters  k  =  £  =  n,  p  =  0, 
then  we  get  wq  -—^  Woo  where 

Woo  :=  gfi^i8hA;("^Fi,n[dec]F„[inc]. 

No  rule  (except  the  reverse  of  rule  (S3))  is  now  apphcable  to  Woo-  Thus  w^o  is  a  terminal 
assertion,  that  is,  if  it  occurs  in  any  simple  derivation  starting  from  wq  then  it  must  be 
the  last  assertion  in  the  derivation.  Such  a  derivation  can  be  shown  to  be  essentially 
the  longest  simple  derivation  from  wq. 


Let  us  observe  the  effects  of  using  the  reverse  rules  of  the  system  So  in  derivations  that 
start  from  Wq.  Clearly,  something  interesting  can  only  happen  if  we  apply  the  reverse 
of  rule  (Fl)  to  a  word  that  has  just  been  derived  using  rule  (F2): 

wQfijiiBhFi,k[dec]Fk[pa.ss\Ak  i^"^'   ,         iy(58tart>io^i.*[inc]Ffc[pass]5fc 

reverse(Fl)       yjQ,^,^^F,,k[dec]Fk[inc]Bk 

reverse(S3j      wQ^^^^B^F.^dec]  Feline]  B^ 

\i  k  >  I  then  we  can  continue  for  d  more  steps,  applying  reverse  of  rule  (S2)  to  finally 
yield 

wQBt^tAiAf^F,,k[dec]Fk[inc]Bk. 


U  k  =  1,  we  can  continue  for  at  most  d  steps,  depending  on  the  Ai -degree.  In  any 
case,  the  transitions  after  applying  reverse  of  (Fl)  are  all  forced  transitions  and  they  are 
reverse  transitions.  This  gives  us  the  following  observation:  In  any  simple  derivation 
starting  from  wq,  the  reverse  transitions  (if  any)  occur  contiguously  at  the  end  of  the 
derivation. 

Recall  that  all  assertions  are  well-formed.  We  now  define  a  commutative  word  w  to  be 
very  well-formed  if 


(1)  w  IS  well-formed. 

(2)  deg^^(u;)  -|-  degg^(u;)  is  either  0  or  d. 

(3)  For  all  k  =  l,...,n, 

if  Ffc[inc]  I  w  then  deg^^(ti;)  <  e{k),degg^{w)  =  0; 
if  Ffc[pass]  I  w  then  deg^^(ti;)  +  degB^{w)  =  e{ky, 
if  Ffc[dec]  I  w  then  deg^  (w)  =  0  ,  degg  (w)  <  e{k). 


We  observe  that  the  standard  derivations  starting  from  wq  involve  only  very  well-formed 
assertions.  The  next  lemma  shows  that  such  derivations  are  unique  under  suitable 
conditions. 

For  fc  =  0, 1, . . .  ,n,  and  any  word  iw,  let 

degju;)  =  deg^^(u;)  +  degg^{w). 
Define  the  depth  of  w  to  be  the  smallest  k  such  deg^(u;)  >  0). 
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Lemma  2  (Basic  Lemma)   Let  I  <  k  <  £  <  n,  and  p  >  0  and  let  w  be  a  commutative 
word  of  depth  >  k. 

(a)  Let  ui  =  wQstajrt-A.oA^Fi^k['^nc]Fk/[pa.ss]Ft[inc]  be  well-formed.  Let  Di  be  a 
simple  derivation  from  Ui  to  u\  where  u\  is  the  only  word  in  Di  divisible  by 
<5finish-Pi,fc[ciec].  Then 

Moreover,  D\   is  unique,  and  the  last  transition  in  D\  is  an  application  of  rule 
(S3). 

(b)  Let  vi  =  vjQs,tajrtAQB^Fi^k[inc]Fk,([pa.ss]Fi[dec]  be  well-formed  with  degg^(iy)  = 
0.  Let  D2  be  a  simple  derivation  from  Vi  to  v[  where  v[  is  the  only  word  in  D2 
divisible  by  (5finish-f^i,t[dec].  Then  p  >  e{k)  and 

Moreover,  D2  is  unique,  and  the  last  transition  in  D2  is  an  application  of  rule 
(S3). 


Proof 

First  let  fc  =  1.  We  shall  be  referring  to  the  "standard  derivation"  as  used  in  the  proof 
of  lemma  1 . 


(a)  Since  the  depth  oi  wis  >  k  >  1.,  degg  (ui)  =  0.  Moreover,  since  Uj  is  well-formed, 
w  contains  no  flag  variables  at  level  <  £.  Thus  the  first  applicable  rule  (Sl)^  in  the 
standard  derivation  is  forced.  The  next  d—1  applications  of  (Si)  is  also  forced 
(since  we  allow  only  simple  derivations.  Then  the  application  of  (S3)  is  also  forced. 
Call  the  result  u".  Now  Qftnish-f'i.ifdec]  =  Qfinish  divides  u".  It  is  now  evident 
that  u"  =  Uj  and  has  the  form  stipulated  in  the  lemma. 

(b)  The  proof  is  similar. 
Now  assume  k  >  1 


(a)  Since  Qfinish^i,fc[dec]  ]  u[,  there  must  be  a  first  word  u"  in  the  derivation  Di  in 
which  (5iiniBh-f\,*:-i[dec]  I  u".  By  the  induction  hypothesis  (a),  u"  has  the  form  of 
U2  in  the  proof  of  lemma  1: 

U2  =  u;Qfini8hA^L\~^^i^i,fc-i[dec]F;t_i[inc]Ffc^[pass]f^[inc]. 


Now  the  only  rule  applicable  to  U2  is  (Fl)  which  yields  U3  as  in  the  standard 
derivation.  (Note  that  the  reverse  of  rule  (S3)  is  also  applicable  to  U2,  but  the 
induction  hypothesis  (a)  implies  that  this  would  make  Di  non-simple.)  Since 
<5finish-f'i,fc-i[dec]  I  u[  but  not  U3,  we  again  see  that  there  is  a  first  word  u\^'  after 

U3  in  Di  such  that  (^finish-^i.t-ii'^^'^]  I  "1    •  By  the  induction  hypothesis  (a),  u\ 
must  have  the  form  u^  as  in  the  standard  derivation.  Now,  we  see  that  either  rule 
(F2)  or  (F3)  is  apphcable  to  U4  (the  reverse  of  rule  (S3)  is  apphcable  but  excluded 
by  repetition- freeness). 

If  (F2)  were  applied  to  U4  we  get  U5  as  in  the  standard  derivation.  Of  course,  we 
can  now  apply  the  reverse  of  (Fl)  although  we  already  saw  that  this  will  not  lead 
to  anything  interesting,  and  certainly  not  to  u[.  Hence  (by  induction  hypothesis 
(a))  we  might  as  well  assume  that  (F2)  is  followed  by  the  standard  derivation  at 
level  A:  —  1.  In  general,  suppose  we  apply  (F2)  followed  by  the  standard  derivation 
at  level  /:  —  1  for  a  total  of  m  >  0  times  to  U4.  The  final  result  is: 

Clearly  m  <  e{k  —  1).  However,  if  m  =  e{k  —  1)  then  we  are  stuck  since  no  rule 
applies.  So  we  assume  m  <  e{k  —  1)  —  1.  Only  rule  (F3)  is  applicable  to  u\  , 
yielding 

ul''  =  u;Q3tart/l^A:L\-'^-'"-'fir--V4"^'''*'"''^i.fc-i[inc]F,_adec]F,.,[pass]F4i^^^ 

Again  there  must  be  a  first  u\  in  Dx  after  u\  '  where  Qfinish^i.it-ildec]  |  u\  . 
By  induction  hypothesis  (b),  m  -f  1  >  e{k  —  1)  and  hence  m  +  1  =  e(^-  —  !)•  This 
shows  that  u\  =  u-j^u\  =  ug  and  u\  =  u\.  By  induction  hypothesis  (b),  the 
last  transition  in  the  derivation  u\  A  u\  is  an  application  of  rule  (S3),  and  hence 
the  same  holds  for  D\ . 

(b)  This  proof  is  similar  to  part  (a).  Q.E.D. 


5      Expanded  rules 


The  size  of  rule  (a,^)  is  max{deg(a),deg(/?)}.  The  rules  (a,/?)  in  the  Thue  system  So 
can  have  size  linear'  in  n,  i.e., 

|q|  =(f  +  Q(n)      or      |/3|  =  J-f-fi(n). 

In  this  section,  we  convert  Sq  to  an  essentially  equivalent  system  S\  whose  rules  have 
constant  sizes,  i.e., 

|a|  =  cf+0(l),    1^1  =  ^  +  0(1). 


^VVe  deem  the  size  dependence  on  cf  as  natural:  in  fact,  d  is  usually  viewed  as  the  maximum  degree 
of  the  rules. 
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To  achieve  this,  we  introduce  n  +  1  new  variables 

Lq,Li,L2,  . . .  ,Ln 

called  level  variables.    Let  Ei  =  EoU{-^o,  •  •  •  ,Ln}.    The  start  rules  (Si)  and  (S2)  are 
replaced  by: 

(Tl)  Lo  Qst^Ap  Xi 

(T2)  Lk  Qst^tFk[pass]Ao  i^^^^  (^b  =  1, . . .  ,n  -  1) 

(T3)  LkAo  QstartF.linc]  lo^o/l^  (ifc  =  l,...,n) 

(T4)  LkBi  <?3tartFfe[dec]^  ^qBo  (A:  =  l,...,n) 

Clearly,  the  rules  (Si)  and  (S2)  can  be  simulated  by  the  new  rules  (Tl)-(T4).    For 
instance,  rule  (Si)  is  simulated  by  the  sequence 

Qstart  Q start  i^l  [pass]  ^  ^  (?starti^fc[ilic]        , 

lo/io  -(^^1^ — (r2r^^^^°T^---(^^^^° — jT^r^  ^'^'^'- 

Next  we  convert  rule  (S3)  to 

(T5)  Vstartio-Bo  >  <5fiaish-^l- 

Similarly,  the  finish  rules  (Fl)-(F3)  becomes,  for  A;  =  1,2,. . .  ,n  —  1: 

(Gl)     Itn[dec]  ^finish      Lk+iFk[iTic] 

(G2)     Qfi^ish^jfei^Jtlinc]  .       gstartJ^o>l^Fjt[pass] 

(G3)       Qfinishi^iAjt  ^^[P^^^j       Qstarti^oA^Sfc 

(G4)     (3fi.iish^fc^*[pass]Ai       ►       Qsx^LQA^Fk[dQc]Bk 

Again,  each  of  the  finish  rules  can  be  simulated  by  (Gl)-(G4).     For  instance,  (F2) 
becomes,  for  k  >2: 

Qfinish-^i-F'i,fc[dec]FA:[pass]Ajt     (Gl)  Qiini3hL2Fx[iTxc]F2,k\^Qc]Fk[pa.ss]Ak 

:  {k  —2)  more  applications  of  (Gl) 

(Gl)  Qfiiii3h^;t^i.*[inc]F*[pass]/ljt 

(G2_)  gstarti^o>lo^i,Jt[inc]F^[pass]5i 
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Let  the  Thue  system  Si  over  Ei  refer  to  the  set  of  rules  (Tl)-(T5)  ,  (Gl)-(G4)  and  their 
reverses.  Evidently,  we  intend  well-formed  words  of  Si  to  be  linear  in  {Lq,  Li,  . . .  ,/,„}. 
We  define  a  word  to  ty  €  Ef  to  be  {very)  well-formed  if  and  only  if  u;  =  Lkw'  for  some 
k  —  0, . . .  ,n  and  w'  G  E®  's  (very)  well-formed  in  the  original  sense.  If  lo  €  S®  is  a 
well-formed  word  such  that  either 

<33tart-^0  I  W  or  (^finish^l   |  ^ 

then  we  call  u>  a  standard  word. 

Clearly,  there  is  a  one-one  map  <i>  that  takes  each  well-formed  word  w'  €  E®  to  a 
standard  word  <f>{iv')  G  E®.  For  instance,  recall  the  assertions  Wq  and  Woo  from  the 
previous  section.  Then  4>{wq)  is  the  initial  assertion  of  Si  and  we  have: 


<t>{wo)   ■■=  QstBiTtLoA^Fi^n+ilinc] 


<f>{woo)   :=   gfinishIiA;(")Fi,„[dec]F„[inc] 

Definition.  Given  two  Thue  systems  (5",  E)  and  (5',  E'),  we  say  that  S  is  embedded  in 
S'  via  a  map  (f>  ii  4>  is  a.  one-one  function  from  the  assertions  of  S  to  the  assertions  of  S' 
such  that  the  initial  assertion  Wq  of  5  is  mapped  to  the  initial  assertion  w'q  of  S'  by  <f>, 
and  the  following  holds.  First,  call  an  assertion  of  5'  standard  if  it  is  of  the  form  (t>{w) 
for  some  w.  Then: 


(a)  If  u  — ►  ly  is  a  transition  of  5  where  u  is  an  assertion,  then  there  is  a  corresponding 

unique  derivation 

D'  =  {4>{u),w[,...,w'^,<l>{w)) 

of  S'  where  u;J, . . .  ,w!^  are  all  non-standard  words. 

(b)  Conversely,  if  D'  =  {(f>{u),w[,. . .  ,w'^,(^{w))  is  a  derivation  of  5'  where  u  is  an 

assertion  of  S  and  w'^,. . .  ,w'^  are  non-standard  words  then  u  — >  u;  is  a  transition 
of  5. 

We  then  have  the  following  result. 


Lemma  3   The  system  So  from  the  previous  section  is  embedded  in  the  system  Si  in 
this  section,  via  the  map  (f>  defined  above. 


We  omit  the  easy  proof.    In  a  natural  way,  all  the  properties  (such  as  uniqueness)  we 
have  proved  for  5*0  transfers  to  5*1. 
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6      Winding  down,  cleaning  up 


The  system  Si,  with  some  trivial  modifications,  counts  to  e(n)  in  the  sense  of  the 
introduction.  In  applications,  we  also  want  the  reverse  process,  to  "count"  down  from 
e(n).  It  is  convenient  to  further  modify  the  system  so  that  the  initial  assertion  is  a  single 
variable  Qo  and  for  the  standard  derivation  (by  definition)  to  eventually  yield  a  single 
variable  Qoo- 

By  winding  down  here,  we  mean  the  addition  of  rules  which  allow  the  assertion 


to  get  rid  of  all  the  A^s  and  to  reach  some  definite  word  w'^.  This  is  easy  to  do  in  Si. 
Recall  that  the  finish  rules  (Gl)-(G4)  do  not  apply  for  k  =  n;  the  new  rules  cover  this 
case. 

Winding  Down  Rules 

(Wl)     F„[inc]  L„  F„[pass] 

(W2)     An  XnF„[pass]      Br, 

(W3)     Qfinishin^n[pass]     ►     QstaitLoA^Fn[dec] 


These  additional  rules  allow  the  standard  derivation  (t>{wo)  -—*  4>{woc,)  to  be  extended 
as  shown  here: 


<l>{w^)       -^       Qfinishi^ni^i,n[inc]A;(")F„[inc]        (by  n  -  1  applications  of  (Gl)) 

:  (apply  (W2)  for  e(n)  times) 

CW2)      Q,i,i3i,L„F,,„[inc]B^(")F„[pass] 
(^      g8tartioA^Fi,„[inc]B^(")F„[dec] 

From  this  last  word,  by  the  analogue  of  lemma  1(b),  we  eventually  derive 

w'^  =  QfinishLiFi^n+i[dec]. 
Moreover,  by  the  usual  arguments,  this  derivation  from  4>{wo)  to  w'^  is  unique. 
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Next,  to  allow  the  system  to  have  unique  starting  and  final  variables,  we  set 

s2  =  E,u{Qo,go,QooC?:„} 

and  introduce  the  following  rules. 
Initialization  Rules 

(II)  go      —  Q'oL^Ai 

(12)  L,Q'^     -^     g^L,+iF,[inc]         (A:  =  1, . . .  ,n  -  1) 

(13)  L^Q'^     -^     QstartioFniinc] 

Clearly  these  rules  imply  Qq  A  <i){wQ).  Similarly  the  next  set  of  rules  imply  w'^  -^  Q^o- 
Cleaning  Up  Rules 

(Cl)       gfinish-^l  *       Q'oo^i 

(C2)     Q'^LkFk[dec]     -^     Q'^L,^,         (/:  =  1, . . .  ,n  -  1) 
(C3)     g:^L„F„[dec]     —     Q^ 

Note  that  rule  (Cl)  if  applied  prematurely  (i.e.  when  not  applied  to  w'^)  will  get  'stuck' 
but  in  any  case  will  not  yield  Q^o- 

Call  the  Thue  system  just  described  (52,  II2).  The  rules  of  S2  are  explicitly  given  by  the 
rules 

(T1)-(T5),  (G1)-(G4),  (W1)-(W3),  (Il)-(I3)  and  (Cl)-(C3). 

We  saw  that  the  following  repetition-free  derivation  of  S2  is  unique: 

The  number  of  variables  in  E2  is  6n  -f  0(1).  However,  we  can  easily  reduce  this  to 
4n  +  0(1)  by  observing  that  the  four  variables 

Ft[inc],  FA:[pass],  Ffc[dec],  Lk 

at  level  k  can  easily  be  encoded  by  introducing  two  new  variables  Fk,Gk  and  using  the 
6  words  FkGl~*  for  i  =  0, . . .  ,5:  for  instance  Lk  is  either  present  or  not  in  a  well-formed 
word,  and  this  can  be  encoded  by  the  parity  (even  or  odd)  of :,  and  the  three  colors  can 
be  associated  with  the  values  [j/2j  =  0,1,2.  The  modifications  to  the  rules  of  52  are 
trivial.  Call  this  system  (53,1)3). 
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7     A  technique  for  encoding  flag  systems 

Let  E  be  a  set  of  commutative  variables  and  S  a  Thue  system  over  E.  We  say  5  has 
parameters  (6,  n,(f)  if  S  has  <  bn  rules  and  each  rule  has  size  <  d-^b.  (Note:  previously, 
b  has  been  hidden  in  the  big-Oh  notation.)  We  call  a  subset  F  C  E  a  system  of  flags  for 
S  (relative  to  {b,n,d))  provided: 

(1)  Each  rule  (a,/3)  G  S  satisfies  degp(a^)  <  6. 

(2)  Each  assertion  w  of  S  satisfies  degp{w)  <  b  for  each  F  G  F. 

In  this  section  we  show  this  general  result: 

Theorem  1  There  is  a  universal  constant  c  >  0  such  that  the  following  holds.  Let  S 
over  E  have  parameters  {b,n,d)  and  F  C  E.  is  a  system  of  flags  for  S.  Then  S  can  be 
embedded  in  a  Thue  system  S'  over  E'  where  S'  has  parameters  {cb,n.,d)  and 

E'  =  (E  -  F)  U  F' 
and  |F'|  <  cb^J\F\. 


In  other  words,  we  can  replace  the  system  of  flags  F  with  a  new  set  F'  that  is  roughly 
square-root  the  original  size. 

We  can  apply  this  result  to  the  system  S2  that  has  parameters  (6,  n,  d)  for  some  small 
b  <  10.  The  set 

Fi  =  {F>:[inc],Ft[pass],Ffc[dec]   :   k  =  l,...,n} 

is  a  system  of  flags  for  52-  But  so  is  F2  =  {Li, . . . ,  L„},  and  the  union  of  two  systems 
of  flags  is  also  a  system  of  flags  (perhaps  using  a  difi"erent  b  constant).  Applying  this 
theorem  to  F  =  Fj  U  F2,  we  obtain  a  system  (call  it)  Soo  over  a  set  of  2n  +  0{y/n) 
variables,  such  that  ^2  is  embedded  in  it.  If  52  is  embedded  in  5oo  via  the  map  <f>  then 
the  derivation 

HQo)  ^  <l>{Qoo)      (mod5oo) 

is  unique.  With  trivial  modifications  to  Soo  to  satisfy  the  concept  of  counting  as  formal- 
ized in  the  introduction,  we  may  conclude  the  main  result  of  this  paper: 

Theorem  2  There  is  a  constant  Cq  >  0  such  that  for  all  n  >  l,d  >  2,  there  exists 
a  commutative  Thue  system  {Sn,d,^n,d)  such  that  Sn,d  counts  to  ed(n)  =  d^  ,  |E„(i|  < 
2n  +  Coy/n,  \Sn,d\  <  Con,  each  rule  in  Snj  has  size  <  d  +  Cq. 
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In  the  rest  of  this  section,  we  will  prove  theorem  1.  It  is  simpler  to  assume  that  |r|  +  1 
is  a  perfect  square, 

for  some  integer  r  >  0.  We  define  the  integer  functions  fi,  v  where  for  any  integer  A:, 


v{k)  =  k  mod  r. 
We  shall  use  these  functions  for  fc  —  1,. . . ,  |r|;  in  this  case  k  =  fi{k)r  +  u{k). 

Encoding  of  flag  information  by  groups.  Introduce  2(r  +  4)  variables 

D'   D"  n'       D" 

to  encode  the  information  of  the  flag  variables  F.  We  partition  T  into  r  groups,  T  = 
Ti  U  r2  U  •  •  •  U  Tr  with 

r,   =   {F(,_i)r  +  1,  F(,_i)r  +  2,  •  ■  •  ,  fir} 

where 

r  =  {Fi,F2,...,F|r|}. 

Let  u  €  rf  and  for  each  F(,_i)r+j  €  F,-,  let  the  F(,_i)r+j-degree  of  v  be  rij.  We  assume 
that  Uj  <  b  where  6  is  the  constant  in  the  statement  of  theorem  1.  We  encode  such  a  v 
by  a  word  in  {D[,D"}^  as  follows.  Let 

n{v)  =  J:n,b'-\ 

(Remark:  we  shall  call  rii  the  zero-th  (lowest)  digit  of  the  integer  n{v),  n2  the  first  digit, 
and  so  on.)  Notice  that  n{v)  <  V  and  so  we  can  encode  v  by  the  word 

D,[n{v)\  =  (d;)"('''(D,")'''~"*"^ 


The  embedding  map  <)>.  Now  if  i;iV2  •  •  •  v^w  is  an  assertion  of  S  where  degp(u;)  =  0 
and  Vi  G  Ff  then  we  define  the  map  4>  by 

4>[Vx  ■  ■  ■  VrW)  =  Qstandard^lhl]  "  "  "  Dr[nr\w 

where  (^standard  is  a  new  state  variable,  n,  =  n(y,)  is  the  integer  encoding  the  degrees 
of  the  variables  in  F,,  obtained  as  in  n{v)  above.  We  call  a  word  of  the  form  <^(u;)  a 
standard  word  of  S'  if  w  is  an  aissertion  of  5.  This  function  4>  will  embed  S  in  the  system 
5'  to  be  constructed. 
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Each  pair  {D'-,D"]  forms  what  will  be  referred  to  as  the  Di[  ]-counter.  Such  a  counter 
encodes  an  integer  0  <  m  <  6*"  a5  (Z)-)'"(Z),")'''^~'".  Thus  it  is  easy  to  detect  if  the  value 
of  such  a  counter  is  zero:  we  use  a  b-aiy  counter  that  counts  up  to  6*".  More  on  6-ary 
counters  below.  It  is  importcmt  to  realize  that  the  map  <f>  is  one-one  only  for  those  words 
w  which  satisfy  degp{w)  <  b  for  each  F  G  F;  by  definition  of  flag  systems,  this  is  the 
case  when  w  is  an  assertion.  Notice  that  the  Dh[  ]-counters  (/i  =  0,  r  -f  1, . . . ,  r  +  3)  are 
unused  at  this  point. 

The  transformed  rules.  For  each  rule  (R)  of  S  we  introduce  a  transformed  rule  (R)' 
of  S'  as  follows.  Let  (R)  have  the  form 

(R)     (77  -^   t6 

where  ctt  6  F®  and  degp(76)  =  0.  Then  its  transformation  (R)'  is: 

(R)'      QstandardT    "^    Qstep^E[cr,T] 

where  Qstep  is  a  new  state  variable  and  £^[(7,  r]  is  a  special  encoding  of  ((7,  t)  to  be  de- 
scribed next.  The  state  Qstep  in  (R)'  indicates  that  we  are  now  simulating  one  transition 
step  of  5. 

The  encoding  E[(7,t].  Clearly  deg(crr)  <  b.  So  <7,  t  has  the  form 

where  1  <  /ii  <  /i2  <  •  •  •  <  /ip  <  |F|,  Yl^^i  "^t  +  n,-  <  6  and  each  rrii  +  Ui  >  1.  Let  us 
note  that  the  triple  (m,,ni,/i,)  in  this  context  amounts  to  the  foUowing  'instruction' 

^(m,,n.)|^^j  .  i^Yie  Fk,-degvee  of  the  current  word  must  be  at  least  m,  for  the  application 
of  the  rule  (R),  and  this  F;,,- degree  should  be  incremented  by  n,  —  m,  after  the 
application  of  rule  (R)'. 

To  encode  such  instructions,  we  introduce  2(6  -f  l)^r  new  variables 

for  i,jj'  =  0,...,6.  For  any  k  =  l,...,|r|,  and  iJJ'  =  0,...,6,  let  ul^'^'^k]  and 
N^       [k]  be  shorthand  for  the  variables 

Mi''\k]   :=   M^;^, 
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Also,  let 

We  intend  for  the  word  MN^'  [k]  to  encode  the  instruction  Z^^'^'\k]  (the  role  of  i  will 
be  clearer  below). 

The  encoding  of  cr,  r  can  now  be  given. 

The  presence  of  MA'^p^i  [1]  simply  encodes  the  value  of  p  in  the  definition  of  a,  t;  this 
encoding  makes  sense  since  p  <h  and  m^  +  "i  <  6. 

Remark.  Note  that  rule  (R)'  may  be  applicable  to  a  word  (f){w)  even  though  rule  (R) 
is  not  applicable  to  w  (this  happens  when  7  ^  1  in  rule  (R)).  It  will  turn  out  that  our 
rules  ensure  that  such  unintended  transitions  will  eventually  become  "stuck",  i.e.,  be 
unable  to  return  to  a  standard  word  without  repetition.  On  the  other  hand,  if  rule  (R) 
were  apphcable  to  u;,  the  derivation  will  eventually  return  to  the  standard  \yord  .<^(u;') 
where  w   — >  w'  is  a  transition  of  S  obtained  by  applying  rule  (R). 

Simulation  by  stages.  We  carry  out  the  simulation  of  a  step  in  stages  (in  fact  there 
are  p  stages  if  the  rule  to  be  simulated  ha^  a,T  as  above).  At  the  beginning  of  each 
stage,  the  current  word  will  in  general  contain  the  catcdyst  word 

MNi^^l\l]MN!''''\k]  (1) 

for  some  i,j,j',k,  indicating  that 

(a)  We  are  in  stage  i 

(b)  We  want  to  cairry  out  instruction  Z^^-^'^[k]. 

When  done,  we  proceed  to  the  next  stage  by  transforming  MNl^\[l]MNl^'^'[k]  to 
M N}  '  [1].  When  we  finally  reach  reach  stage  0  as  indicated  by  the  presence  of 
MTVJ  ■  ^[1],  we  know  that  we  had  successfully  checked  and  transformed  all  the  flag  values 
necessary  for  the  application  of  rule  (R).  We  may  change  the  state  from  Qstep  back  to 

Vstandard- 

In  the  rest  of  this  simulation,  we  assume  that  the  integers  i,j^j',k  have  been  fixed  by 
the  occurence  of  the  catalyst  (1).  Let  us  recall  that  the  Fk-degree  of  w  is  encoded  as  the 
^{ky^  digit  of  the  Z?^(fc)-degree  in  <p{w).  If  m  is  this  D^^j-degree,  then  the  Ffdegree  is 
given  by 

[iSyJ  ( "^°^  *)• 
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Each  stage  consists  of  seven  phases. 

Phase  0.  Initialization. 

We  initialize  the  auxilliary  counters  !)/,[  ]-counters  for  /i  =  r  -|- 1,  r  +  2,  r  +  3  to  zero.  We 
transfer  the  contents  of  the  D^f^k)[  ]-counter  to  the  Do[  ]-counter  and  set  up  appropriate 
"level  variables"  to  indicate  that  the  6-ary  counter  is  to  be  u{k)  digits  long  only.  This 
ensures  that  the  rules  of  the  remaining  phases  (except  for  phase  6)  are  independent^  of 
n{k)  and  v{k). 

Note  on  fe-ary  counters.  The  reader  should  realize  that  the  ability  to  initialize  coun- 
ters and  transfer  the  contents  of  a  counter  to  another  depends  the  fact  that  we  caji  set 
up  6-ary  counters  that  'count'  to  6*"  (recall  the  introductory  section  of  this  paper).  Since 
these  6-ary  counters  count  to  at  most  6'',  they  are  are  easily  implemented  using  0{br) 
variables.  Associated  with  the  6-ary  counter  are  0{br)  rules  to  increment,  decrement  or 
test-for-zero.  Each  such  rule  has  size  0(1).  When  done  carefully  (e.g.,  by  introducing 
state  variables  to  indicate  that  the  6-ary  counter  should  be  incremented,  etc),  these  rules 
can  also  be  shared  by  all  the  phases.  Hence,  we  only  need  one  6-ary  counter.  The  details 
of  such  a  construction  are  left  as  an  exercise. 

Phase  1.    Computes  some  i  <   [m/6''(*')j   in  the  Dr+i[  ]-counter,  and  decrements  the 

Do[]-counter  by  Ib^^^K 

Recall  that  m  (the  Z)^(fc)-degree  of  w)  is  encoded  by  the  word 

Do[m]  =  {DXiD'^f-""- 

In  this  phase,  we  will  reduce  the  Do[  ]-counter  by  some  ^  >  0  multiples  of  6"^''',  and 
store  the  value  of  ^  in  the  previously  unused  counter  Dr+i[  ].  This  is  easily  done,  using  a 
6-ary  counter  that  counts  to  6"^'"^  where  each  increment  of  the  6-ajy  counter  is  coupled 
with  a  decrement  of  the  Do[  ]-counter.  Whenever  the  6-axy  counter  returns  to  0,  we 
have  the  choice  of  going  to  the  next  phase  or  of  decrementing  Do[  ]-counter  by  another 
6"^''';  in  the  latter  case,  we  increment  the  value  of  £  in  the  Dr+i[  ]-counter.  K  £  exceeds 
[Tn/b''^'''\,  we  become  stuck  in  a  state  from  which  there  is  no  return  to  a  standard  word. 

Phase  2.  Transfers  the  residual  value  of  the  Do[]-counter  to  the  Dr^2[]- counter. 
We  want  to  check  that  £  is  exactly  equal  to  [m/6''(*^^J.  This  amounts  to  checking  that 
the  Do[  ]-counter  has  a  residual  value  less  than  6"^*^^  So  we  use  the  6-ary  counter  of 
phase  1  to  continue  decrementing  the  Do[  ]-counter.  To  save  the  residual  value  of  the 
Do[  ]-counter,  we  simultaneously  increment  the  Dr+2[  ]-counter,  which  is  unused  until 
now.  Before  each  decrement  of  the  Do[  ]-counter,  we  have  the  choice  of  continuing  to 
decrement  or  to  go  to  phase  3.  If  we  succeeded  in  decrementing  6"^'^^  times,  then  we 
enter  a  stuck  state  {£  is  too  small). 


^If  the  phases  are  not  made  independent  of  ft{k)  and  i/{k)  then  the  number  of  rules  simply  grows 
by  a  factor  of  r. 
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Phase  3.   Detects  if  the  transfer  to  /^r+jf  ]-counter  is  complete  and  thus  confirms  that 

i  =  [m/i-^*)]  . 

To  do  this,  we  simply  verify  that  the  Do[  j-counter  is  zero  (otherwise  we  should  get 

stuck). 

Phase  4.  Computes  some  £'  <  [i/b\  in  Dr+af  ]-counter  and  decrements  i  in  the  Dr+i[  ]- 
counter  by  £'b. 

We  now  have  the  value  of  £  =  [m/b^^''^\  in  the  Dr+i[  ]-counter.  The  F^-degree  we  seek 
is  i  mod  b.  Similar  to  phase  1,  we  now  subtract  some  £'  >  0  multiples  of  b  from  the 
Dr+i[  ]-counter,  and  keep  track  of  £'  in  the  (hitherto  unused)  jDr+af  ]-counter.  If  £'  gets 
larger  than  \_i/b\,  we  get  stuck.  Otherwise,  we  reach  phase  5. 

Phase  5.   Checks  that  £'  =  [£/b\  and  computes  the  Fk-degree 

Again,  we  must  check  if  £'  =  [£/b\.  This  is  accomphshed  in  one  step  by  subtracting 
some  integer  _;'"  =  0, . . . ,  6  —  1  (nondeterministically)  from  the  Dr+\[  ]-counter  and  then 
checking  if  the  Dr+i[  ]-counter  is  now  zero.  If  the  counter  is  indeed  zero,  then  j"  is  the 
Ffc-degree.  Otherwise  we  get  stuck. 

Phase  6.  Checks  that  Fk-degree  is  sufficient  for  applying  the  current  rule  and  carries 
out  the  transformation  of  the  Fk-degree. 

Now  we  check  that  j"  discovered  in  the  previous  phase  is  indeed  >  j,  and  if  so,  we  make 
the  value  of  the  Dr+i[  ]-counter  equal  to  j"  +  {j'  —  j).  We  must  now  restore  the  value 
of  the  i5^(fc)[  ]-counter,  using  this  formula 

where  d^  refers  to  the  value  in  the  Dh[  ]-counter.  The  details  are  left  to  the  reader. 

This  completes  the  description  of  the  simulation  of  a  transition  step.  Let  us  tabulate 
the  statistics  of  the  system  S'. 

Variables.  There  are  0{b'^r)  new  variables  explicitly  described  above.  We  need  another 
0{br)  variables  to  implement  the  6-ary  counters.  Of  course,  there  are  also  0(1) 
state  variables  to  remember  each  part  of  each  phase. 

Rules.  We  have  one  transformed  rule  for  each  of  the  rule  of  5;  these  rules  have  size  at 
most  <f+36+0(l)  (cf.  (R)'  above).  We  introduced  rules  for  the  7  phases.  In  phases 
0  and  6,  we  need  to  transfer  values  between  the  Do[  ]-counter  and  the  appropriate 
Di[  ]-counter.  This  can  be  done  with  0{r)  rules  of  size  0(1).  Not  counting  the 
rules  for  the  6-ary  counters,  most  of  the  remaining  rules  amount  to  incrementing 
or  decrementing  the  Z)^[  ]-counters  for  /i  =  0,r  +  1, . . .  ,r  +  3;  there  are  0(1)  such 
rules,  each  of  size  0(1).  In  phases  5,  we  have  b  rules  of  size  0(6),  corresponding 
to  guessing  the  value  of  j".  Finally,  the  number  of  rules  for  the  6-ary  counter  and 
their  sizes  are  0(6r)  and  0(1),  respectively,  as  already  explained. 
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This  completes  the  proof  of  theorem  1. 

Final  remarks.  We  have  not  attempted  to  optimize  the  constant  factors  in  this  con- 
struction. For  example,  we  note  that  the  0{b^r)  MA'^-variables  in  turn  form  a  flag 
system  for  the  system  5"',  so  the  theorem  1  can  be  applied  again.  Hence  we  could  actu- 
ally get  away  with  only  |r'|  =  0{br)  new  variables  (2r  variables  for  the  Z),[  ]-counters 
and  0{br)  for  the  binary  counters).  Another  idea  that  can  improve  constant  factors 
is  that  some  flag  systems  (such  as  the  level  variables  Li,...,L„)  actually  satisfy  the 
stronger  property  that  each  assertion  has  bounded  degree  in  F,  not  just  bounded  degree 
in  each  F  G  F.  In  that  case,  with  some  extra  effort,  only  O(logn)  variables  suffice  to 
encode  such  flag  systems.  Finally,  for  any  integer  a;  >  3,  it  sufBces  to  use  only  0(|rp/^) 
new  variables  if  we  are  willing  to  replace  the  A/A^-encoding  of  an  integer  by  words  of 
length  X. 


8      Conclusion 


The  two  major  ideas  in  our  construction  is  the  "quadratic  counter"  as  embodied  in  the 
Thue  system  5o,  and  the  technique  of  encoding  flag  systems.  The  obvious  open  question 
is  whether  the  number  of  variables  can  be  further  reduced  below  ~  2n. 

The  system  52  can  be  trivially  modified  to  carry  out  the  Mayr-Meyer  simulation  of 
counter  machines  when  the  counters  are  known  to  be  bounded  by  e(n).  We  do  not 
elaborate  on  this.  (Of  course,  we  could  use  ^3  or  ^oo,  but  we  prefer  52  since  it  is 
exphcitly  given  in  this  paper.)  The  other  application  of  52,  to  show  double  exponential 
degree  of  reduced  Grobner  bases  along  the  hues  of  Huynh  [3]  or  Mora-Moller  [5]  seems 
not  so  easy.^  In  the  following,  we  give  a  simple  proof  that  our  system  demonstrates 
the  double-exponential  degree  behaviour  for  the  ideal  membership  problem  and  syzygy 
problem.  This  kind  of  behaviour  was  first  observed  by  Bayer- Stillman  [ij. 

We  use  the  following  simple  lemma  from  MajT- Meyer  [4]: 

Lemma  4  Given  a  commutative  Thue  system  S  over  the  variables  E,  let  Fs  = 
{/ii  •  •  •  5/m}  Q  Q[S]  be  the  corresponding  set  of  polynomials.  Let  ^,  Q  €  E®.  // 

^-a=^J29Ji         (5.€Q[E]) 

then  there  is  a  derivation 

a  —*  Wj  —y  W2  —*  ■  ■  •  —*  ^{modS) 


^Huynh  provided  only  a  sketchy  justification  of  his  rather  involved  construction;  it  seems  non-trivial 
to  nail  down  the  necessary  properties  of  such  a  system.  Mora-Moller  testify  that  their  "proofs  are  too 
long  to  be  included"  in  the  paper.  Both  situations  should  be  remedied  in  the  sense  of  providing  more 
transparent  constructions  and  proofs. 
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such  that  for  each  j, 

deg{wj)  <  ma.x{deg{gj,)    :   i  =  l,...,m}. 

For  the  system  (52,^2),  we  now  show: 


Theorem  3  Let  F2  =  {fi,-  ■  ■ ,  fm}  Q  QI'^t]  f>e  the  set  of  polynomials  corresponding  to 
the  Thue  system  Sj.  Let  /o  =  Qoo  —  Qo-  Then  /o  €  (-Flz)-  Furthermore,  for  any  sequence 
of  polynomials  g  =  (gi,  •  •  •  ,5m),  gi  ^  Ql^^],  »/ 

m 

/o  =  Y.9Ji 

i-l 

then  deg{g)  =  e{n)  -0{d). 


Proof  Clearly  the  derivation  from  Qo  to  (5oo(raodS2)  implies  /o  £  (F2).  If  /o  =  Y1T=\  9ifx^ 
then  the  cited  lemma  of  Mayr-Meyer  implies  a  derivation  Qo  —^  Wi  -*  W2  —^  ■  ■  ■  —^  Qoo 
where  deg(u)_,)  <  max{deg(5,/,)  :  i  =  l,...m}  for  each  j.  But  this  derivation,  after 
omitting  repetitions,  is  the  standard  derivation  from  Qq  to  Q^o  and  thus  (i>{wao)  occurs 
in  this  derivation.  From  deg{4>{woo))  >  e(n)  and  deg(/,)  =  d  +  0(1)  (i  =  1,. . .  ,m),  the 
result  follows.  Q.E.D. 

This  double  exponential  lower  bound  for  ideal  membership  in  F2  is  easily  turned  into  a 
similar  bound  for  minimal  syzygies.  Consider  the  set  of  polynomials 

^2U{/o)- 

Let  {go,gi, . . .  ,gm)  =  g  he  a.  minimal  syzygy  of  F2[j{fo}  where  ^o  =  1-  Note  that  g 
exists.  Hence  fo=—  YlT=i  9ifi-  Then  the  preceding  result  implies  deg(^)  =  e(n)  —  0{d). 
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